skip to main content
US FlagAn official website of the United States government
dot gov icon
Official websites use .gov
A .gov website belongs to an official government organization in the United States.
https lock icon
Secure .gov websites use HTTPS
A lock ( lock ) or https:// means you've safely connected to the .gov website. Share sensitive information only on official, secure websites.


Search for: All records

Creators/Authors contains: "Yin, John"

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

  1. Synopsis Viral evolution unfolds across nested layers of adaptation, much like a set of Matryoshka dolls. The outermost, well-studied layer involves interactions between viruses and their hosts—where immune evasion, cross-species transmission, and long-term coevolution drive viral diversification. Yet, hidden within this framework is an often-overlooked inner layer: the coevolution of viruses with their own molecular parasites, defective interfering (DI) particles, and defective viral genomes (DVGs). These molecular parasites exploit viral replication machinery, reshaping infection dynamics and imposing selective pressures that influence viral fitness, transmission, and persistence. This perspective synthesizes evidence from experimental evolution, mathematical modeling, and molecular virology to propose a more integrated view of viral evolution. By framing host–virus interactions and virus-DI particle dynamics within a unified evolutionary framework, we highlight the underappreciated role of DI particles as evolutionary players, not just aberrant byproducts. Recognizing these internal layers of viral evolution may inform the development of antiviral strategies and broader questions in host–pathogen coevolution. 
    more » « less
  2. Abstract Deletions are prevalent in the genomes of SARS-CoV-2 isolates from COVID-19 patients, but their roles in the severity, transmission, and persistence of disease are poorly understood. Millions of COVID-19 swab samples from patients have been sequenced and made available online, offering an unprecedented opportunity to study such deletions. Multiplex PCR-based amplicon sequencing (amplicon-seq) has been the most widely used method for sequencing clinical COVID-19 samples. However, existing bioinformatics methods applied to negative control samples sequenced by multiplex-PCR sequencing often yield large numbers of false-positive deletions. We found that these false positives commonly occur in short alignments, at low frequency and depth, and near primer-binding sites used for whole-genome amplification. To address this issue, we developed a filtering strategy, validated with positive control samples containing a known deletion. Our strategy accurately detected the known deletion and removed more than 99% of false positives. This method, applied to public COVID-19 swab data, revealed that deletions occurring independently of transcription regulatory sequences were about 20-fold less common than previously reported; however, they remain more frequent in symptomatic patients. Our optimized approach should enhance the reliability of SARS-CoV-2 deletion characterization from surveillance studies. Finally, our approach may guide the development of more reliable bioinformatics pipelines for genome sequence analyses of other viruses. 
    more » « less
    Free, publicly-accessible full text available April 16, 2026
  3. Abstract Viruses persist in nature owing to their extreme genetic heterogeneity and large –population sizes, which enable them to evade host immune defenses, escape anti-viral drugs, and adapt to new hosts. The persistence of viruses is challenging to study because mutations affect multiple virus genes, interactions among genes in their impacts on virus growth are seldom known, and measures of viral fitness have yet to be standardized. To address these challenges, we employed a data-driven computational model of cell infection by a virus. The infection model accounted for the kinetics of viral gene expression, functional gene-gene interactions, genome replication, and allocation of host cellular resources to produce progeny of vesicular stomatitis virus (VSV), a prototype RNA virus. We used this model to computationally probe how interactions among genes carrying up to 11 deleterious mutations affect different measures of virus fitness: single-cycle growth yields and multi-cycle rates of infection spread. Individual mutations were implemented by perturbing biophysical parameters associated with individual gene functions of the wild-type model. Our analysis revealed synergistic epistasis among deleterious mutations in their effects on virus yield; so adverse effects of single deleterious mutations were amplified by interaction. For the same mutations, multi-cycle infection spread indicated weak or negligible epistasis, where single mutations act alone in their effects on infection spread. These results were robust to simulation under high and low host resource environments. Our work highlights how different types and magnitudes of epistasis can arise for genetically identical virus variants, depending on the fitness measure. More broadly, gene-gene interactions can differently affect how viruses grow and spread. 
    more » « less
  4. Virus-like particles resemble infectious virus particles in size, shape, and molecular composition; however, they fail to productively infect host cells. Historically, the presence of virus-like particles has been inferred from total particle counts by microscopy, and infectious particle counts or plaque-forming-units (PFUs) by plaque assay; the resulting ratio of particles-to-PFUs is often greater than one, easily 10 or 100, indicating that most particles are non-infectious. Despite their inability to hijack cells for their reproduction, virus-like particles and the defective genomes they carry can exhibit a broad range of behaviors: interference with normal virus growth during co-infections, cell killing, and activation or inhibition of innate immune signaling. In addition, some virus-like particles become productive as their multiplicities of infection increase, a sign of cooperation between particles. Here, we review established and emerging methods to count virus-like particles and characterize their biological functions. We take a critical look at evidence for defective interfering virus genomes in natural and clinical isolates, and we review their potential as antiviral therapeutics. In short, we highlight an urgent need to better understand how virus-like genomes and particles interact with intact functional viruses during co-infection of their hosts, and their impacts on the transmission, severity, and persistence of virus-associated diseases. 
    more » « less
  5. Abstract Viruses are the most numerically abundant biological entities on Earth. As ubiquitous replicators of molecular information and agents of community change, viruses have potent effects on the life on Earth, and may play a critical role in human spaceflight, for life-detection missions to other planetary bodies and planetary protection. However, major knowledge gaps constrain our understanding of the Earth's virosphere: (1) the role viruses play in biogeochemical cycles, (2) the origin(s) of viruses and (3) the involvement of viruses in the evolution, distribution and persistence of life. As viruses are the only replicators that span all known types of nucleic acids, an expanded experimental and theoretical toolbox built for Earth's viruses will be pivotal for detecting and understanding life on Earth and beyond. Only by filling in these knowledge and technical gaps we will obtain an inclusive assessment of how to distinguish and detect life on other planetary surfaces. Meanwhile, space exploration requires life-support systems for the needs of humans, plants and their microbial inhabitants. Viral effects on microbes and plants are essential for Earth's biosphere and human health, but virus–host interactions in spaceflight are poorly understood. Viral relationships with their hosts respond to environmental changes in complex ways which are difficult to predict by extrapolating from Earth-based proxies. These relationships should be studied in space to fully understand how spaceflight will modulate viral impacts on human health and life-support systems, including microbiomes. In this review, we address key questions that must be examined to incorporate viruses into Earth system models, life-support systems and life detection. Tackling these questions will benefit our efforts to develop planetary protection protocols and further our understanding of viruses in astrobiology. 
    more » « less
  6. The Ty1 retrotransposon family is maintained in a functional but dormant state by its host, Saccharomyces cerevisiae . Several hundred RHF and RTT genes encoding co-factors and restrictors of Ty1 retromobility, respectively, have been identified. Well-characterized examples include MED3 and MED15 , encoding subunits of the Mediator transcriptional co-activator complex; control of retromobility by Med3 and Med15 requires the Ty1 promoter in the U3 region of the long terminal repeat. To characterize the U3-dependence of other Ty1 regulators, we screened a library of 188 known rhf and rtt mutants for altered retromobility of Ty1 his3AI expressed from the strong, TATA-less TEF1 promoter or the weak, TATA-containing U3 promoter. Two classes of genes, each including both RHF s and RTT s, were identified. The first class comprising 82 genes that regulated Ty1 his3AI retromobility independently of U3 is enriched for RHF genes that restrict the G1 phase of the cell cycle and those involved in transcriptional elongation and mRNA catabolism. The second class of 51 genes regulated retromobility of Ty1 his3AI driven only from the U3 promoter. Nineteen U3-dependent regulators (U3DRs) also controlled retromobility of Ty1 his3AI driven by the weak, TATA-less PSP2 promoter, suggesting reliance on the low activity of U3. Thirty-one U3DRs failed to modulate P PSP2 -Ty1 his3AI retromobility, suggesting dependence on the architecture of U3. To further investigate the U3-dependency of Ty1 regulators, we developed a novel fluorescence-based assay to monitor expression of p22-Gag, a restriction factor expressed from the internal Ty1i promoter. Many U3DRs had minimal effects on levels of Ty1 RNA, Ty1i RNA or p22-Gag. These findings uncover a role for the Ty1 promoter in integrating signals from diverse host factors to modulate Ty1 RNA biogenesis or fate. 
    more » « less
  7. null (Ed.)
    Abstract Although viruses in their natural habitats add up to less than 10% of the biomass, they contribute more than 90% of the genome sequences [1]. These viral sequences or ‘viromes’ encode viruses that populate the Earth’s oceans [2, 3] and terrestrial environments [4, 5], where their infections impact life across diverse ecological niches and scales [6, 7], including humans [8–10]. Most viruses have yet to be isolated and cultured [11–13], and surprisingly few efforts have explored what analysis of available data might reveal about their nature. Here, we compiled and analyzed seven decades of one-step growth and other data for viruses from six major families, including their infections of archaeal, bacterial and eukaryotic hosts [14–191]. We found that the use of host cell biomass for virus production was highest for archaea at 10%, followed by bacteria at 1% and eukarya at 0.01%, highlighting the degree to which viruses of archaea and bacteria exploit their host cells. For individual host cells, the yield of virus progeny spanned a relatively narrow range (10–1000 infectious particles per cell) compared with the million-fold difference in size between the smallest and largest cells. Furthermore, healthy and infected host cells were remarkably similar in the time they needed to multiply themselves or their virus progeny. Specifically, the doubling time of healthy cells and the delay time for virus release from infected cells were not only correlated (r = 0.71, p < 10−10, n = 101); they also spanned the same range from tens of minutes to about a week. These results have implications for better understanding the growth, spread and persistence of viruses in complex natural habitats that abound with diverse hosts, including humans and their associated microbes. 
    more » « less
  8. Abstract Why do some biological systems and communities persist while others fail? Robustness, a system's stability, and resilience, the ability to return to a stable state, are key concepts that span multiple disciplines within and outside the biological sciences. Discovering and applying common rules that govern the robustness and resilience of biological systems is a critical step toward creating solutions for species survival in the face of climate change, as well as the for the ever-increasing need for food, health, and energy for human populations. We propose that network theory provides a framework for universal scalable mathematical models to describe robustness and resilience and the relationship between them, and hypothesize that resilience at lower organization levels contribute to robust systems. Insightful models of biological systems can be generated by quantifying the mechanisms of redundancy, diversity, and connectivity of networks, from biochemical processes to ecosystems. These models provide pathways towards understanding how evolvability can both contribute to and result from robustness and resilience under dynamic conditions. We now have an abundance of data from model and non-model systems and the technological and computational advances for studying complex systems. Several conceptual and policy advances will allow the research community to elucidate the rules of robustness and resilience. Conceptually, a common language and data structure that can be applied across levels of biological organization needs to be developed. Policy advances such as cross-disciplinary funding mechanisms, access to affordable computational capacity, and the integration of network theory and computer science within the standard biological science curriculum will provide the needed research environments. This new understanding of biological systems will allow us to derive ever more useful forecasts of biological behaviors and revolutionize the engineering of biological systems that can survive changing environments or disease, navigate the deepest oceans, or sustain life throughout the solar system. 
    more » « less